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Abstract 


Many genetic and mechanistic features distinguish the coronavirus replication machinery from that encoded by most other RNA viruses. The 
coronavirus replication/transcription complex is an assembly of viral and, most probably, cellular proteins that mediate the synthesis of both the 
unusually large (~30 kb) RNA genome and an extensive set of subgenomic mRNAs. The viral components of the complex are encoded by the giant 
replicase gene, which is expressed in the form of two polyproteins (ppla and pp1lab) that are processed into 16 cleavage products (nonstructural 
proteins 1-16). Using the combination of yeast two-hybrid screening and GST pull-down assays, we have now analyzed all potential interactions 
between SARS-Coronavirus nonstructural proteins, which may contribute to the structure and/or function of the viral replication/transcription 
complex. We demonstrate the existence of a complex network of interactions involving all 16 nonstructural proteins. Our results both confirmed 
previously described associations and identified novel heterodimerizations. The interaction map thus provides a sum of the interactions that may 
occur at some point during coronavirus RNA synthesis and provides a framework for future research. 
© 2007 Elsevier B.V. All rights reserved. 
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1. Introduction Coronaviruses are enveloped viruses with large positive- 


sense single-stranded RNA genomes ranging from 27 to 


The family Coronaviridae includes the genera Coronavirus 
and Torovirus that, together with the more distantly related 
Arteriviridae and Roniviridae families, form the order Nidovi- 
rales (for reviews see Gorbalenya et al., 2006). In 2003, a 
novel coronavirus was identified as the etiological agent of the 
Severe Acute Respiratory Syndrome (SARS; Drosten et al., 
2003; Ksiazek et al., 2003; Peiris et al., 2003) and is now known 
as the SARS-Coronavirus (SARS-CoV). 
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32 kb, the largest among the known RNA viruses. Like other 
nidoviruses, coronaviruses genomes are polycistronic with 
about two thirds being occupied by two large open reading 
frames (ORFs 1a and 1b) that specify the viral nonstructural pro- 
teins (nsps) or “replicase”. The coronavirus genome is expressed 
using a variety of regulatory mechanisms, including the synthe- 
sis and proteolytic processing of replicase polyproteins and the 
generation of an extensive nested set of subgenomic (sg) mRNAs 
to express structural and accessory proteins from the 3’-proximal 
part of the genome (for recent reviews, see Pasternak et al., 
2006; Sawicki et al., 2007). Genome expression starts with the 
translation of replicase ORFs 1a and 1b, with expression of the 
latter involving a — 1 ribosomal frameshift (Brierley et al., 1989; 
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Fig. 1. SARS-CoV replicase organization. Cleavage products and their nomenclature, and (predicted) protein functions are depicted. Abbreviations: nsp, nonstructural 
protein; AD, acidic domain; XD, X domain; SUD, SARS-CoV unique domain; PLP2, papain-like proteinase; PLnc, papain-like noncanonical; Y, nsp3-Y domain; 
TM, transmembrane domain; ssRBP, single-strand RNA binding protein; dsRBP, double-strand RNA binding protein; RdRp, RNA-dependent RNA-polymerase; 
ZD, zinc-binding domain; 2’-O-MTase, 2'-O-methyltransferase; ?, unknown function. 


Brierley and Dos Ramos, 2006) occurring with an estimated effi- 
ciency of 20-40%. In the case of SARS-CoV, ppla and pplab 
are 4382 and 7073 amino acids long, respectively (Marra et al., 
2003; Rota et al., 2003; Snijder et al., 2003; Thiel et al., 2003). 
These precursors are cleaved co- and post-translationally by two 
viral proteases, a papain-like protease (PLP2) residing in nsp3 
and the 3C-like protease (3CLpro; nsp5), to produce the mature 
replicase subunits nsp1 to nsp16 (Fig. 1). Most of these subunits 
are believed to associate with a poorly characterized membrane- 
bound complex (Bost et al., 2001; Gosert et al., 2002; Shi et al., 
1999; Snijder et al., 2006; van der Meer et al., 1999) that medi- 
ates the synthesis of genome RNA (replication) and sg mRNAs 
(transcription). The sg mRNAs are produced by a unique mech- 
anism involving discontinuous negative-strand RNA synthesis 
to produce subgenome-length templates for sg mRNA synthesis 
(Sawicki and Sawicki, 1995; reviewed in Pasternak et al., 2006; 
Sawicki et al., 2007). 

Knowledge related to the functions and interactions of coro- 
navirus nsps has remained limited thus far. nsp1 and nsp2 
are rather variable in sequence, and possibly subgroup-specific 
replicase subunits (Snijder et al., 2003; Ziebuhr et al., 2001). 
SARS-CoV nsp1 (20 kDa) was implicated in suppression of cel- 
lular gene expression by promoting host cell mRNA degradation 
(Kamitani et al., 2006) and in chemokine dysregulation (Law et 
al., 2007). SARS-CoV nsp2 (70 kDa) is dispensable for replica- 
tion in cell culture, but its deletion attenuates viral replication 
(Graham et al., 2005). SARS-CoV nsp3 is a large multidomain 
protein of 1922 amino acids (Snijder et al., 2003; Thiel et al., 
2003) that is thought to contain at least seven domains: (1) an 
N-terminal Glu-rich acidic domain (AD); (2) an X domain (XD) 
with poly(ADP-ribose) binding properties (Egloff et al., 2006; 
Saikatendu et al., 2005); (3) the SUD domain (for SARS-CoV 
Unique Domain, an insertion not found in any other coronavirus 
thus far) with a specific affinity for oligo(G)-strings (Tan et al., 
in press); (4) a papain-like protease (PLP2), recently shown to 
exhibit deubiquitinating activity (Barretto et al., 2005; Harcourt 


et al., 2004; Lindner et al., 2005; Ratia et al., 2006); (5) an 
unknown domain possibly extending the papain-like protease 
domain, termed PLnc for Papain-Like noncanonical (see below); 
(6) a transmembrane domain (Kanjanahaluethai et al., 2007) 
corresponding to the N-terminal of the Y domain; and (7) the 
remainder of the Y domain, the abbreviation “Y domain” will 
be used for this part in this study. The multi-spanning trans- 
membrane domains in nsp3, nsp4 and nsp6 are presumed to 
serve as a scaffold for the assembly of the membrane-associated 
replication/transcription complex (RTC). nsp5 (34kDa) is a 
3C-like cysteine protease (3CLpro), also known as “main pro- 
tease” (MP?°) (Anand et al., 2003; Gorbalenya et al., 1989), 
cleaving the replicase polyproteins at 11 conserved sites con- 
taining a canonical Leu-Gln (Ser, Ala, Gly) signature. nsp8 
protein (22 kDa) was recently postulated to be a processivity fac- 
tor associated with the viral RNA-dependent RNA polymerase 
(RdRp) in nsp12. Indeed, the crystal structure of a hexade- 
cameric nsp7—nsp8 complex provided a first glimpse of the 
sophisticated architecture of the coronavirus replication and 
transcription machinery (Zhai et al., 2005). The properties of 
the central channel of this complex suggest that it may encircle 
double-stranded RNA. This hypothesis was strengthened by the 
finding that nsp8 displays primase activity in an in vitro assay 
(Imbert et al., 2006) and might thus catalyse the synthesis of 
RNA primers for the primer-dependent RdRp. nsp9 (13 kDa) 
is a ssRNA-binding protein (Egloff et al., 2004; Sutton et al., 
2004) postulated to stabilise and protect nascent and template 
RNA strands during replication and/or transcription. Much like 
nsp8 and nsp9, nsp10 (18 kDa) is also a nucleic acid-binding 
protein with two zinc fingers (Joseph et al., 2006; Matthes 
et al., 2006). The phenotype of a temperature-sensitive nsp10 
mutant (Q65E) of the coronavirus mouse hepatitis virus (MHV) 
implicated nsp10 in minus-strand RNA synthesis (Sawicki et 
al., 2005; Siddell et al., 2001). The C-terminal part of nsp12 
(102 kDa) contains the poorly characterized RdRp domain that is 
significantly divergent from both cellular and viral polymerases. 
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Table 1 
SARS-CoV replicase subunits and domains used in this study 


SARS-CoV nsp/domain 


Genome position* of domains used in this study 


Position in ppla/pplab of domains used in this study 


nsp1 nt 268-801 
nsp2 nt 802-2718 
nsp3-AD nt 2719-3294 
nsp3-XD nt 3277-3786 
nsp3-SUD nt 3841-4890 
nsp3-PLP2 nt 4891-5817 
nsp3-PLne nt 5818-6975 
nsp3-Y nt 7378-8484 
nsp5 nt 9985-10902 
nsp6 nt 10903-11772 
nsp7 nt 11773-12021 
nsp8 nt 12022-12615 
nsp9 nt 12616-12954 
nsp10 nt 12955-13371 
nsp12 nt13398-16166 
nsp13 nt 16167-17969 
nsp14 nt 17970-19550 
nsp15 nt 19551-20588 
nsp16 nt 20589-21485 


1Met-Gly 180 
181Ala-Gly818 
819Ala-Leu1010 
1005GIn-Pro1 174 
1193Glu-Val1542 
1543Lys-Ser2025 
1852Thr-Pro2237 
2372Lys-Gly2739 
3241Ser-GIn3546 
3547Gly-G1n3836 
3837Ser-G1n3919 
3920Ala-Gln4 117 
4118Asn-G1n4230 
4231Ala-GIn4369 
4370Ser-G1n5301 
5302Ala-G1n5902 
5903 Ala-G1n6429 
6430Ser-GIn6775 
6776Ala-Asn7073 


@ All nucleotide numbers given refer to the SARS-CoV genome sequence (strain Frankfurt-1, GenBank accession number AY291315). 


Only one study reported polymerase activity of a purified coro- 
navirus RdRp (Cheng et al., 2005). nsp13 (67 kDa) comprises 
a C-terminal helicase domain with a 5’—3’ directionality and an 
N-terminal zinc-binding domain (Ivanov et al., 2004b; Seybert 
et al., 2000, 2005). Finally, nsp14 (58kDa), nsp15 (38 kDa) 
and nsp16 (33 kDa) contain exoribonuclease (Minskaia et al., 
2006), endoribonuclease (Bhardwaj et al., 2006; Ivanov et al., 
2004a) and predicted 2’-O-ribose methyl transferase activities 
(von Grotthuss et al., 2003), respectively, which are all essen- 
tial for efficient coronavirus RNA synthesis (Almazan et al., 
2006). In addition, as illustrated above, coronavirus replicases 
have evolved to include a number of nonessential functions that 
may provide a selective advantage in specific host cells only. 

Most of the 16 coronavirus nsps are known or thought to be 
part of the RTC, which is formed following genome translation 
and replicase polyprotein processing. As for many plus-strand 
RNA viruses (Mackenzie, 2005; Salonen et al., 2004; Snijder 
et al., 2006), it has been demonstrated that nidovirus RNA syn- 
thesis is associated with cytoplasmic membranes, in this case 
double-membrane structures that likely originate from the endo- 
plasmic reticulum. Obvious benefits of complex formation are 
the possibility to couple functions residing in different subunits 
and the sequestration of viral components. The complex and 
sophisticated coronavirus replication/transcription strategy sug- 
gests that the composition of the complex may change during the 
life cycle of the virus or that — alternatively — partially different 
complexes dedicated to, e.g. either replication or transcription 
may co-exist, probably relying on the same central RNA syn- 
thesizing catalytic subunit, the nsp12-RdRp. 

The network of interactions within the SARS-CoV RTC has 
only begun to emerge. Dual-labelling studies in SARS-CoV- 
infected cells have confirmed the colocalization of a variety 
of replicase subunits (Harcourt et al., 2004; Ivanov et al., 
2004a; Prentice et al., 2004; Snijder et al., 2006) and newly 
synthesized viral RNA (Harcourt et al., 2004). Chemical cross- 


linking experiments demonstrated that SARS-CoV nsp10 could 
be cross-linked to nsp9 (Su et al., 2006), which in turn inter- 
acts with nsp8 (Sutton et al., 2004). Very recently, a SARS-CoV 
genome-wide analysis for intraviral protein—protein interactions 
was performed (von Brunn et al., 2007). Interactions were tested 
by yeast two-hybrid system and subsequently confirmed by co- 
immunoprecipitation experiments in mammalian cells. nsp2 and 
nsp8 showed a rather large number of interactions. In this study, 
the nsp3 protein was subdivided into 2 fragments by cutting to 
the SUD domain. 

Coronavirus nsp12-RdRp is believed to play a central role in 
both replication and transcription. Here, we investigated inter- 
actions between all possible combinations of SARS-CoV nsps, 
including the nsp12-RdRp, which are potentially involved in 
the assembly and/or function of the RTC. The experiments were 
performed with in vitro translated nsps, as well as ex vivo, in Sac- 
charomyces cerevisiae expressing these nsps in the context of a 
two-hybrid system. We observed a complex network of interac- 
tions involving all 16 nsps, confirming previously documented 
associations and identifying numerous novel heterodimerization 
events, and thus providing a framework for future research. 


2. Materials and methods 
2.1. Virus and sequence 

The Frankfurt-1 isolate of SARS-CoV (GenBank accession 
number AY291315; Thiel et al., 2003) was amplified in Vero-E6 
cells and used in this study for the production of the expression 
vectors. 


2.2. Yeast two-hybrid matrix experiment 


All protocols used for the yeast two-hybrid system were 
described previously (Walhout and Vidal, 2001). MaV103 and 
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MaV203 yeast strains (leu2-3112, tryp-901, his3 A200, ade2-1, 
gal4A, gal80A, SPAL10::URA3, GAL1::lacZ, GAL1::HIS3- 
@LYS2, canlR, cyh2R), which are MATa and MATalpha, 
respectively, were used for the yeast two-hybrid matrix. RT-PCR 
products encoding different SARS-CoV proteins/domains, i.e. 
nspl, nsp2, six nsp3 subdomains (AD, XD, SUD, PLP2, PLnc, 
and Y), nsp5 to nsp10, and nsp12 to nsp16 were first cloned into 
pDONR201 using the Gateway system (Invitrogen). All con- 
structs were verified by sequence analysis. The boundaries of 
the domains used are detailed in Table 1. Using the Gateway 
system, each cDNA was subcloned into pAD and pDB vec- 
tors, which were used to transform MaV203 and MaV 103 cells, 
respectively. Diploid cells were generated by mating MaV103 
expressing genes fused to the DNA-binding domain (DB-ORF) 
and MaV203 expressing genes fused to the activation domain 
(AD-ORP). Selection was performed using selective complete 
(SC) medium lacking tryptophane and leucine. Patches rep- 
resenting potential interaction pairs were then examined for 
two-hybrid phenotypes by replica-plating onto selective plates, 
i.e. YPD (yeast extract, peptone, D-glucose) medium plate with 
a nitrocellulose filter on to perform the B-galactosidase filter 
lift assay, as well as on SC-Leu-Trp-His + 25 mM 3-AT and SC- 
Leu-Trp-Ura plates to assess the three two-hybrid phenotypes 
(B-galactosidase, His and Ura) (Walhout and Vidal, 2001). The 
matrix experiment was carried out three times, using clones 
derived from three different plasmid transformation and mating 
experiments. 


2.3. Expression vectors 


For GST fusion protein expression vectors, BamHI/EcoRI 
restriction fragments encoding the SARS-CoV PLnce domain, 
nsp8 and nsp9, and a BamHI restriction fragment encoding 
nsp12 were inserted in-frame into pGEX-4T-2 (GE Healthcare), 
to generate GST-PLnc, GST-nsp8, GST-nsp9 and GST-nsp12, 
respectively. GST-p55, encoding the HIV-1 Gag precursor, was 
constructed as described in Douaisi et al. (2004). To create 
SARS-CoV protein expression vectors, EcoRI/KpnI restriction 
fragments encoding nsp1, XD, SUD, PLP2, PLnc, Y, nsp5, nsp6, 
nsp7, nsp10, nsp16, EcoRI/Xbal restriction fragments encoding 
nsp2, nsp8, nsp9, AD, a Xbal/Notl restriction fragment encod- 
ing nsp12, and KpnI/Not!I restriction fragments encoding nsp13, 
nsp14, nsp15 were cloned into pTNT (Promega). All constructs 
were sequenced, and the proteins expressed from these plasmids 
did not carry any foreign tags, except an additional N-terminal 
Met residue. 


2.4. Protein expression in E. coli and purification 


The GST-PLnc (69kDa), GST-nsp8 (48 kDa), GST-nsp9 
(38.8 kDa), GST-nsp12 (132 kDa) and GST-p55 (80 kDa) fusion 
proteins were expressed in E. coli strain C41(DE3) (Avidis 
SA, France), which contained the pLysS plasmid (Novagen). 
Bacterial cultures were grown at 37°C in LB medium 
supplemented with ampicillin (100gml~!) and chloram- 
phenicol (17 »g ml!) until the ODgo9 reached 0.5. Protein 
expression was induced by adding 500 uM isopropyl 1-thio- 


8-pD-galactopyranoside (IPTG) and cells were incubated for 4h 
at 37°C. In the case of GST-PLnc and GST-nsp12 expression, 
however, induction was performed with 100 wM of IPTG and 
cells were incubated for 17h at 17°C. Cell pellets were resus- 
pended in PBS supplemented with 0.25 mg ml~!, of lysozyme, 
0.1 mg ml~! DNase, 10 pg ml~! RNaseA, | mM phenylmethy]- 
sulphonylfluoride (PMSF), protease inhibitor cocktail (Sigma), 
and 1% Triton X-100. The suspensions were sonicated and cen- 
trifuged at 25,000 x g for 30 min, after which 500 wl of a 50% 
slurry of glutathione-sepharose 4B beads (GE Healthcare) was 
added to the clarified supernatants. The beads were allowed to 
bind proteins for | h, washed three times with PBS, and finally 
resuspended in 500 wl of PBS. 


2.5. GST pull-down assays 


RNAs specifying SARS-CoV nsp1, nsp2, nsp3 domains (AD, 
XD, SUD, PLP2, PLnc, Y), nsp5 to nsp10, and nsp12 to nsp16 
were in vitro transcribed from the T7 promoter present in the 
pINT expression vector. Transcripts were in vitro translated 
in the presence of [*°S]methionine using the TNT T7 coupled 
reticulocyte lysate system (Promega) according to the manufac- 
turer’s instructions. In vitro binding analyses were performed as 
described previously (Dimitrova et al., 2003). Each interaction 
was independently tested three times. 


2.6. Western blot analysis 


Protein samples were resolved by SDS-PAGE, transblotted 
onto Hybond-P membranes (GE Healthcare) and probed with 
an anti-GST antibody that was kindly provided by Dr. Teillaud 
(INSERM Unité 255, Paris, France). 


3. Results and discussion 
3.1. Yeast two-hybrid screening matrix for SARS-CoV nsps 


The RTCs of mammalian positive-stranded RNA viruses, 
including coronaviruses, are commonly associated with intra- 
cellular membranes (Prentice et al., 2004; Snijder et al., 2006) 
and consist of multiple virus-encoded protein subunits as well 
as host proteins. To further characterize the molecular interac- 
tions between SARS-CoV RTC subunits, we performed a yeast 
two-hybrid matrix screen with all nsps encoded by this virus. An 
interaction was considered valid when it reproducibly produced 
a positive result for one or more of the three different yeast phe- 
notypes tested (Fig. 2a). Subsequently, as a general strategy, GST 
pull-down assays were used to confirm interactions (see below). 
Because of the large size of the membrane-associated nsp3 
(213 kDa), six nsp3 subdomains were defined (Table 1). The 
general bioinformatics approach that led to the design of nsp3 
domains was described previously (Ferron et al., 2005). Briefly, 
we performed a BLAST search against the VaZyMolO database 
with the full sequence corresponding to nsp3. The protein is 
predicted to be organized in 6 modules that are the F197 (AD) 
domain structurally confirmed by crystallography (Serrano et al., 
2007), the F200 (XD) domain structurally confirmed by crystal- 
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Fig. 2. (a) Analysis of SARS-CoV nonstructural protein interactions by Y2H matrix screen. Y2H matrix screen was performed by mating S. cerevisiae strains MaV203 
and MaV 103 containing prey and bait vectors with the respective SARS-CoV nsps on selective media. All nsps were tested against each other. Positive interactions 
in yeast are represented by squares with light blue, corresponding to single positive phenotype, azure blue corresponding to double positive phenotypes and dark 
blue corresponding to triple positive phenotypes. (b) Schematic representation of the outcome of the yeast two-hybrid matrix screen in which each SARS-CoV nsp 
was tested against all others. In view of its size, nsp3 (blue circle) was divided into a number of subdomains (see text) for the purpose of this experiment. Arrows 
indicate the interaction, from bait to prey. Thick arrows represent strong interactions (a positive result for all three phenotypes tested) and thin arrows indicate weaker 
interactions (one or two positive results). Nodes in blue and green represent nsp3 domains and ORF 1b-encoded proteins, respectively. No interaction partners were 
identified for the proteins represented on the left side of the figure. 
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lography (Egloff et al., 2006), the F209 (SUD), the F203 (PLP2) 
domain also structurally confirmed by crystallography (Ratia et 
al., 2006), the F250 (PLnc) (Tan et al., in press), and a hydropho- 
bic domain followed by the F202 (Y domain). The hydrophobic 
cluster analysis of module F250 (PLnc domain) can be inter- 
preted as a single and globular domain flanked by the PLP2 in 
N-terminal and a hydrophobic domain in C-terminal. 

A total of 19 proteins/domains were used in our screening 
matrix. Out of the 361 protein pairs tested in the yeast two-hybrid 
analysis, 15 combinations produced a positive result, accounting 
for 4% of the possible protein interactions (Fig. 2). In compari- 
son, about 6% of nonstructural protein—protein interactions were 
positive in the von Brunn et al. (2007) Y2H study. Surprisingly, 
no interaction is common to both studies. This discrepancy could 
be explained by the fact that two-hybrid false positives may arise 
in two-hybrid arrays and by the difference between the Y2H 
systems used (Uetz, 2002). With our Y2H system, to eliminate 
potential false-positive interaction, three different phenotypes 
were assessed (8-galactosidase, His and Ura) and the transcrip- 
tion from each reporter gene (LacZ, HIS3 and URA3) is driven 
by independent promoters. At last, we have considered an inter- 
action as positive if at least one phenotype in 2 out of the 3 matrix 
experiments was positive. 


3.1.1. Identification of multiple interactions for nsp3 and 
nsp14 

Two SARS-CoV proteins (nsp3 and nsp14) were found to 
be involved in interactions with 4 or more partners. Domains 
from nsp3 interacted with nsp5, nsp6, nsp12, nsp13, nsp14 and 
nsp16, and in particular for the PLnc subdomain the unexpect- 
edly large number of 5 interaction partners was identified. Since 
this domain is comprised of only 386 amino acids, it is tempt- 
ing to speculate that it might sequentially interact with these 
partners rather than with all of them simultaneously. Conse- 
quently, nsp3 may serve as one of the RTC scaffolding proteins 
by bringing several nsps into close proximity of each other. 
The presence of a transmembrane domain within nsp3, and 
the observed interaction with the nsp6 transmembrane protein, 
reinforces the hypothesis of a membrane-associated scaffolding 
function for nsp3 in the infected cell. The nsp3 central role was 
not so clear in the von Brunn et al. (2007) study because nsp3 
protein was subdivided only into two fragments (nt 2719-4431 
and nt 4885-8484), splitting the SUD domain in two. 

The nsp14 protein is an exoribonuclease playing an elusive, 
yet essential, role in the SARS-CoV life cycle (Minskaia et 
al., 2006). In addition to its interaction with nsp3, nsp14 inter- 
acted strongly with both nsp10, a protein with two zinc-fingers 
potentially involved in nucleic acid recognition, and nsp16, 
the 2’-O-ribose methyltransferase. Interestingly, the interactions 
between nsp10, nsp14 and nsp16 were the only reciprocal ones, 
meaning that they were also detected when the DB and AD 
fusion partners were swapped. It is tempting to speculate that 
these three proteins form a complex that is associated with nsp3 
and that the other interactions that were detected may reflect 
different and/or alternative complexes existing at different time 
points. The surface of interaction of nsp10 with nspl6 was 
mapped by reverse yeast two-hybrid system and in mammalian 


cells, using the bioluminescence resonance energy transfer tech- 
nology (BRET). Point mutations that abrogate the nsp10-nsp16 
interaction also inhibit the interaction between nsp10 and nsp14, 
albeit with subtle differences, suggesting that the interacting 
surface is overlapping (PL, unpublished data). Two crystal struc- 
tures were available for nsp10. One of them reveals a unique 
spherical dodecamer architecture (Su et al., 2006) and the other 
shows a monomer (Joseph et al., 2006). nsp10 protein is able to 
bind single- and double-stranded RNA and DNA without appar- 
ent sequence specificity (Joseph et al., 2006) and was suggested 
to be involved in SARS-CoV replication based on structural 
similarities with nuclear proteins (Su et al., 2006). The overall 
positive charge of the inner and outer surfaces of the dodecamer 
would be compatible with an RNA-binding function. Interest- 
ingly, nsp10 also interacts with nsp9, a single-stranded RNA 
binding protein (Egloff et al., 2004; Su et al., 2006; Sutton et al., 
2004) that in turn interacts with nsp8, the putative RNA primase 
(Imbert et al., 2006). 

In human coronavirus 229E, mutagenesis of the active site 
of the nsp14 exonuclease greatly reduced the accumulation of 
viral RNA, suggesting an essential role for this enzyme in repli- 
cation and transcription (Minskaia et al., 2006). This critical 
function of nsp14 was also supported by results obtained with a 
SARS-CoV replicon, in a study that also addressed the impact 
of mutations within or deletion of nsp15- and nsp16-coding 
sequences, clearly demonstrating that these proteins are required 
for efficient replication (Almazan et al., 2006). Finally, by inter- 
acting with nsp3 through the PLnc domain, and with nsp7, nsp9 
and nsp10, the nsp14 exoribonuclease may also be a key RTC 
scaffolding component, irrespective of its nucleic acid binding 
activity. 


3.1.2. One-on-one interactions 

A number of nsps (nsp1, nsp13, and nsp15) interacted with 
a single partner and these interactions were only detected when 
they were used as prey (Fig. 2). SARS-CoV nsp1, a small pro- 
tein (Almeida et al., 2007), unique to SARS-CoV, believed to 
be involved in host cell RNA degradation, may not directly par- 
ticipate in viral RNA synthesis. Consequently, it may be not too 
surprising that its only interaction partner was nsp9, a single- 
stranded RNA binding protein that, in turn, was not densely 
connected to other nsps (Fig. 2). 

In the case of the large nsp13 and nsp15 subunits, it is impos- 
sible at this stage to assess whether their size (or any other 
factor) may have impeded their expression or correct folding 
in yeast, thus preventing detection of protein-protein interac- 
tion in the yeast two-hybrid system. Alternatively, the toxicity 
of RNA-modifying enzymes that is often observed when they 
are expressed as recombinant proteins may have played a role. 
nsp13 and nsp15 could also belong to larger hetero-multimers, 
like the nsp7—nsp8 complex. Notably, and despite the docu- 
mented interaction of nsp8 with nsp7, nsp15 and nsp8 formed 
the sole isolated protein pair. We failed to detect nsp10 dimer- 
ization in the yeast-two hybrid system, as in the von Brunn et 
al. (2007) study, even though one of the two crystal structures 
reveals a dodecameric arrangement based on nsp10-nsp 10 inter- 
actions (Su et al., 2006). The second crystal structure shows a 
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Fig. 3. (a) Immunoblot analysis of purified, glutathione-sepharose-immobilized GST-p55, GST-PLnc, GST-nsp8, GST-nsp9, and GST-nsp12. Samples of glutathione- 
sepharose bound, bacterially expressed were analyzed by SDS-PAGE followed by immunoblotting with an anti-GST antibody. For the GST-nsp12 sample, 
material derived from 20 times more bacteria was loaded. (b) Autoradiogram from SDS-PAGE analysis of in vitro translated SARS-CoV nonstructural pro- 
teins. [°°S]Methionine-labelled samples were subjected to SDS-PAGE and autoradiography. Positions of molecular size standards are indicated on the left side, in 
kilodaltons. When additional products are visible in the gel, the band of the expected size is indicated with a star. 


monomer (Joseph et al., 2006). The dodecamer supercomplex 
may lack biological relevance, because the second zinc-finger 
lacks the last cysteine residue and the remainder of the C- 
terminal tail is disordered. Unexpectedly also, in particular 
since helicases are known to interact with RdRps in RTCs 
(Dimitrova et al., 2003; Piccininni et al., 2002), the nsp13 heli- 
case stood alone and interactions between nsp12 and nsp13 were 
not detected. 


3.1.3. Interaction orphans 

In the yeast two-hybrid matrix screening, a number of pro- 
teins did not interact with any other nsp or subdomain. These 
were nsp2 and the XD, SUD and Y domains of nsp3. It is interest- 
ing to note that nsp2, which appears to be unique to SARS-CoV, 
is dispensable for replication and transcription (Graham et 
al., 2005). Also, the ADP-ribose-1”-monophosphatase (ADRP) 
activity of the XD subdomain was found to be dispensable for 
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Fig. 4. GST pull-down analysis to identify interaction partners of nsp12-RdRp. Each panel shows an autoradiogram from a SDS-PAGE analysis of in vitro-translated 
nonstructural proteins pulled down by binding to either glutathione-sepharose-bound GST-p55 (negative control; lanes A) or GST-nsp12 (lanes B). Equal amounts 
of [3>S]methionine-labelled target were used for lanes (A and B) and the input material was also run next to the pull-down samples, as indicated for each panel. 
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Fig. 5. GST pull-down analysis using nsp8 (a), the nsp3 PLne domain (b), and nsp9 (c) as bait. Each panel shows an autoradiogram from a SDS-PAGE 12% analysis 
of in vitro-translated nsps brought down by binding (a) GST-nsp8, (b) GST-PLnc and (c) GST-nsp9. Positions of molecular size standards are indicated on the left 
side, in kilodaltons. When additional products are visible in the gel, the band of the expected size is indicated with a star. 


coronavirus replication (Putics et al., 2005). The SUD domain 
binds oligo(G) stretches and may be involved in the modulation 
of host-cell signalling pathways (Tan et al., in press). Finally, 
the Y domain of nsp3 has no obvious role in RNA synthesis. 
Therefore, it is logical to propose that they may rather play a 
role in virus-host interactions to, e.g. modulate immunity and/or 
pathogenesis. 


3.1.4, Few interactions for the RNA-dependent RNA 
polymerase (nsp12) 

Surprisingly, despite its presumed central role in RNA syn- 
thesis, the nsp12-RdRp was found to interact with only two other 
nsps, namely nsp7 and nsp3. nsp7—nsp8 could be part of a larger 
complex including nsp12 on the one hand and nsp14, nsp9 and 
nsp10 on the other (Fig. 2). These results also reflected the poten- 
tial limitations of the yeast-two-hybrid approach when analyzing 
large proteins like nsp12, prompting us to develop an alternative 
to confirm and/or extend our interaction map. 


3.2. GST pull-down analysis 


It is well-known that — for technical reasons — yeast two- 
hybrid screening may not reveal all interactions that normally 
occur in a cellular environment, owing, e.g. to steric hindrance 
of the fusion partners used in yeast two-hybrid assays or the 
inability of certain fusion proteins to be expressed and tar- 
geted to the yeast nucleus. Therefore, as an additional approach 
to identify potential interactions of the PLnc domain, nsp8, 
nsp9 and nsp12-RdRp with other SARS-CoV nsps, we car- 
ried out a set of pull-down assays using GST fusion proteins. 
This type of assay can be used to analyze the binding of a 
radiolabeled, in vitro-translated protein to an interaction part- 
ner presented in the form of a purified GST fusion protein 
bound to glutathione-sepharose beads. GST fusion proteins, as 
well as a HIV-1 gag fusion protein (GST-p55) that was used 
as negative control, were expressed in E. coli and immobilized 
on glutathione-sepharose beads (Fig. 3a). GST-p55, GST-PLnc, 
GST-nsp8 and GST-nsp9 were much more soluble in bacteria 
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Fig. 6. Schematic summary of GST pull-down results. Nodes in blue and green represent nsp3 domains and ORF 1b-encoded proteins, respectively. Proteins that did 


not interact with the bait are depicted at the left side of each diagram. 


than GST-nsp12 and could be purified in considerably larger 
amounts (Fig. 3a). Therefore, only 1/50 of a standard GST-p55, 
GST-nsp8 and GST-nsp9 preparation was used in the assays, 
whereas the complete yield of a GST-nsp12 purification was 
used (Fig. 3a). To obtain potential interaction partners, not con- 
taining any foreign tags, all SARS-CoV nsps and domains used 
before (Table 1) were in vitro translated in the presence of 
[?°S]-labelled methionine, except for nsp4 (an integral trans- 
membrane domain) and nsp11 (which is only 13 amino acids 
long) (Fig. 3b). Despite repeated attempts, we could not achieve 
the expression of nsp6 and nsp9 in the TNT T7-coupled retic- 
ulocyte lysate system. The in vitro synthesis of full-length 
proteins was confirmed by SDS-PAGE analysis (Fig. 3b). A few 
truncated products were observed for the SUD, PLP2, nsp15 
and nsp16 proteins. The AD protein corresponds to the lower 
band on the different gels, according to the protein molecu- 
lar weight marker. We do not have an explanation regarding 
the other higher band observed for AD. nsp2 protein (70 kDa) 
appears to migrate more slowly than nsp12 protein (102 kDa) 
(Fig. 3b, middle panel). This discrepancy may be related to the 
amino-terminal part of nsp2 protein which appears to be dis- 
ordered. It is documented that disordered protein might have 
an atypical migration pattern, which is variable according to 
the percentage of acrylamide sieving. GST pull-down assays 
were carried out with approximately equimolar levels of input 
material. 


3.2.1. GST pull-down experiments identify multiple 
interaction partners for nsp12-RdRp 

Pull-down assays with GST-nsp12 revealed that the SARS- 
CoV RdRp does not interact with nsp1, AD, nsp7, or nsp10 
(Figs. 4 and 6). Interactions between GST-nsp12 and nsp2, XD, 


SUD, PLnc, PLP2, and nsp8 were detected, in the absence of a 
signal for the GST-p55 negative control (Fig. 4, lanes B). Interac- 
tions were also observed with the nsp3-Y domain, nsp5, nsp13, 
nsp14, nsp15 and nsp16. nsp3-Y domain, nsp12, nsp14 and 
nsp16 also showed a weak interaction with GST-p55 (Fig. 4, 
lanes A) and GST alone (not shown), but the signal clearly 
increased when specific SARS-CoV GST-nsp fusion proteins 
were used. Therefore, these background signals were attributed 
to the large amount of the control proteins used in comparison 
with the amount of GST-nsp12. In contrast to the yeast two- 
hybrid analysis, GST-nsp12 failed to interact with nsp7. The 
same interactions were detected after RNase treatment of the in 
vitro translation products used for these binding studies, indi- 
cating that the observed interactions were not mediated by RNA 
(data not shown). 

From these GST pull-down experiments with nsp12-RdRp, 
a clear pattern emerged: the RdRp-containing subunit is appar- 
ently able to built two interactomes: one with nsp3 domains and 
the other with products encoded by ORF1b, which had been 
missed in the yeast two-hybrid experiments. Notably, nsp12 is 
also able to dimerize, perhaps providing an explanation for the 
fact that many interactions involving nsp12 were missed using 
the yeast two hybrid system (see also below). The nsp12—nsp7 
interaction could not be reproduced using the GST pull-down 
approach. The negative results for nsp1, AD, and nsp10 were 
consistent with the yeast two-hybrid data. 


3.2.2. Overlapping interaction profiles for nsp8 and nsp12 
The recently discovered primase activity of nsp8 made this 
protein a very interesting bait for an extended interaction analy- 
sis. The interaction pattern of GST-nsp8 was found to be related 
to that of GST-nsp12 (Fig. 6). Although interactions with mul- 
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tiple nsp3 subdomains (XD, SUD, PLnc and Y) were missing, 
nsp2, PLP2, nsp5 and ORF 1b-encoded products, were identi- 
fied as interactors, yielding the same interaction tree obtained 
for GST-nsp12. The bands corresponding to SUD and Y domains 
did not migrate to the expected distance but much lower further 
(Fig. 5a, left panel). For the SUD domain, a strong tendency to 
proteolytic (self-)degradation was shown by others (Tan et al., 
in press). However, this discrepancy in size was not observed 
with the GST-PLnc pull-down assay (see below). Furthermore, 
these experiments confirmed the nsp8/nsp15 interaction pre- 
viously identified by yeast two-hybrid screening (Fig. 2) and 
revealed that nsp8 dimerized, in accordance with structural, bio- 
chemical, and enzymatic data (Imbert et al., 2006; Zhai et al., 
2005). 


3.2.3. Confirmation of the large number of interactions of 
the nsp3 PLnc domain 

The yeast two-hybrid screening (Fig. 2) identified the nsp3 
PLne domain as a central interactor in the SARS-CoV RTC. 
Thus, PLnc pull-down assays were performed in order to confirm 
and perhaps extend these findings. Indeed, this alternative assay 
confirmed interactions of PLne with nsp5, nsp12, nsp13, nsp14 
and nsp16 (Figs. 5b and 6). Moreover, additional interactions 
were revealed with nsp2, several nsp3 subdomains (AD, SUD, 
PLP2 and Y), nsp7, nsp8 and nsp15. Taken together, these data 
supported the previously proposed scaffolding role for nsp3, 
in which the PLnc subdomain may fulfill a central organizing 
position. 


3.2.4, An unusually large number of interactions for nsp9 

For unknown reasons, nsp9 expression in the TNT T7- 
coupled reticulocyte lysate system remained unsuccessful. 
Therefore, a GST-nsp9 fusion protein was produced and used to 
complete our interaction screening. GST-nsp9 interacted with 
nspl, nsp2, some nsp3 domains (AD, SUD, PLP2, Y), nsp5, 
nsp7, nsp8, nsp10, nsp12, nsp13, nsp14 and nsp15 (Figs. 5c and 
6). Overall, like the nsp3 PLnc domain, nsp9 seems to have many 
partners and interacts with almost all nsps except for nsp16. The 
GST-nsp9 pull-down analysis confirmed the interactions identi- 
fied above in the yeast two-hybrid approach (nsp9 with nsp1 and 
nsp 14) and also the previously documented nsp9-nsp 10 interac- 
tion (Su et al., 2006). Both are RNA-binding proteins that may 
cooperate at some point in the virus life cycle. The nsp10/GST- 
nsp8 interaction and the nsp10/GST-PLnc interaction are much 
weaker than nsp10/GST-nsp9 interaction. 


4. Conclusion 


In this study, we investigated in vitro and in cellulo the net- 
work of interactions that must exist amongst the viral proteins 
that assemble into the SARS-CoV RTC. These data may enhance 
our understanding of coronavirus replication and transcription at 
the molecular level. We have not only confirmed interactions pre- 
viously deduced from a variety of experimental approaches, but 
have also identified a significant number of previously unknown 
(potential) interactions between SARS-CoV replicase cleavage 
products or domains. In the first place, we have provided evi- 


dence for physical interactions that may be the basis for the 
previously described co-localization of various key enzymes 
of the SARS-CoV RTC in infected cells (nsp3, nsp5, nsp12, 
nsp13 and nsp15; Snijder et al., 2006). Co-immunoprecipitation 
in mammalian 293 cells confirmed interactions of nsp8 with 
nsp2, nsp7, nsp8, nsp9, nsp12, nsp13 and nsp14 (von Brunn 
et al., 2007) and highlighted its central role within the RTC 
machinery. 

Secondly, the nsp3 PLnc domain, nsp8 and nsp12 were able 
to form stable homodimers. Thirdly, the nsp8/nsp15 interac- 
tions initially detected by yeast two-hybrid screening could 
be confirmed in GST pull-down assays. Previously described 
nsp8/nsp9/nsp 10 interactions (Su et al., 2006; Sutton et al., 2004) 
were also confirmed in the latter assay (Fig. 5, panels a and c). 
Finally, three viral interaction partners for nsp2 (nsp8, nsp9 and 
nsp12) were identified in the pull-down analysis, although the 
yeast two-hybrid analysis had not revealed these interactions. 

The coronavirus RTC engages in a variety of RNA synthesis 
processes with replication and sg mRNA synthesis each proceed- 
ing through their specific minus-strand templates. How the RTC 
accomplishes and balances these various processes remains to 
be elucidated. It is unknown whether complexes with different 
composition exist that are specifically dedicated to, e.g. repli- 
cation or transcription. Alternatively, one type of complex may 
be converted into another by the binding or release of specific 
protein partners. In this manner, such protein factors may regu- 
late the balance between replication and transcription, if carried 
out by the same complex, or direct the formation of special- 
ized “replicase“and “transcriptase“complexes. The molecular 
mechanism and factors governing this switch in coronaviruses 
are unknown. In the related arterivirus system, nsp1 was identi- 
fied as a clear example of a protein with a transcription-specific 
function. Deletion of this replicase subunit did not affect repli- 
cation, but reduced sg mRNA synthesis to undetectable levels 
(Tijms et al., 2001). In alphaviruses, temporal regulation of repli- 
case polyprotein cleavage by the nsP2 protease was found to 
be a key factor in regulating the switch from minus- to plus- 
strand RNA synthesis (Kaariainen and Ahola, 2002; Lemm et 
al., 1994), illustrating another barely explored feature of the 
nidovirus/coronavirus system: replicase subunits may act either 
after complete maturation, or as part of larger processing inter- 
mediates, or both. Thus, RTC function may be temporally and 
spatially coordinated by the activity of the protease domains 
embedded in replicase polyproteins. 

Our interactome studies confirmed that the key enzymes 
encoded by replicase ORF 1b likely exist as a complex, which 
may be organized around the dimerizing nsp12-RdRp. It is 
unknown whether dimerization is essential for RdRp activity, 
as, e.g. in the case of hepatitis C virus (Qin et al., 2002). In 
general, RdRp error frequencies are estimated to be in the order 
of 10~* (Huang et al., 2000). For many years, this number, in 
combination with the unusual size of the coronavirus genome, 
has triggered speculations about the presence of proofreading 
or other repair mechanisms. The GST-nsp12 pull-down exper- 
iments identified nsp13 to nsp16 as strong interactors (Fig. 4), 
and the fact that this pattern overlaps with that of the nsp8- 
primase is intriguing (Fig. 5a). The interaction of these proteins, 
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either simultaneously or successively, with the helicase (nsp13), 
3’ — 5’ exonuclease (nsp14), endonuclease (nsp15) and (puta- 
tive) methytransferase (nsp16) supports the hypothesis that these 
subunits may mediate successive steps of the same pathway 
(Snijder et al., 2003) and provides a basis for further experiments 
aimed at, for example, exploring the existence of a proofreading 
mechanism. It is interesting to note that the nsp13 helicase, with 
a 5’-3’ directionality, appears to lack interactions with nsp14, 
nsp15 and nsp16 (our study and von Brunn et al., 2007). In 
flaviviruses, NS3 helicase unwinds its substrates in a 3’-to-5’ 
direction and physical mapping and structural studies indicate 
that the helicase most likely precedes the NSS RdRp undergo- 
ing RNA synthesis (Malet et al., 2007). Upon isolation of an 
active coronavirus RdRp, it will be interesting to determine if 
this is also the case in coronaviruses, in view of the different 
RNA unwinding polarities (Patel and Donmez, 2006). 

In conclusion, our work confirms and defines several associ- 
ation groups among SARS-CoV nsps that will provide a basis to 
investigate cooperative enzymatic functions as well as currently 
unknown tertiary and quaternary structures. In this manner, 
these data may also contribute to the development of in vitro 
assays faithfully reproducing the different steps in coronavirus 
RNA synthesis. Furthermore, compounds affecting interactions 
between essential components of the RTC may inhibit enzymatic 
activities and may thus provide novel perspectives for antiviral 
drug design. 
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